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Abstract 

We propose modified frequentist definition for the determination of confidence 
intervals for the case of Poisson statistics. Namely, we require that 1 — j3 > 
^ P(n\X) > a . We show that this definition is equivalent to the Bayesian 

n=o 

method with prior vr(A) ~ X k . We also propose modified frequentist definition for 
the case of nonzero background. 
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In high energy physics one of the standard problems [1] is the determination of the 
confidence intervals for the parameter A in Poisson distribution 

P(n|A) = -exp(-A) (1) 

from the observed number n obs of events. There are two methods to solve this problem 
- the frequentist and the Bayesian. In Bayesian method [1, 2] due to Bayes theorem the 
probability density for the A parameter is determined as 

ni x _ P{n obs \X)7Y{\) 

Jo {P{n obs \\ )7r(A Jet A 

Here 7r(A) is the prior function and in general it is not known that is the main problem 
of the Bayesian method. Formula (2) reduces the statistics problem to the probability 
problem. At the (1 — a) probability level the parameters X up and Xdown are determined 
from the equation 1 

r^up 

/ p(X\n obs )d\ = 1 - a (3) 

own 

and the unknown parameter A lies between Xdown and X up with the probability 1 — a. The 
solution of the equation (4) is not unique. One can define 



roc , 

/ p(X\n obs )dX = a , (4) 

/ p{X\n obs )dX = (3 . (5) 
Jo 

In general the parameters a and are arbitrary except the evident equality 

a + (3 = a . (6) 

The most popular are the following options [1]: 

1- X down = - upper limit. 

2- X up — oo - lower limit. 

3. / p(X\n obs )dX = / p(X\n obs )dX = — symmetric interval. 

-'0 J ^doWTL 2 

4. The shortest interval - p(X\n obs ) inside the interval is bigger or equal to p(X\n obs ) 
outside the interval. 

lT Jsually a is taken equal to 0.05. 



In frequentist approach the Neyman belt construction [3] is used for the determination 
of the confidence intervals. Namely, for each A we require that 

n+(A) 

E P{n\\)>l-a. (7) 

n=ri_(A) 

For the observed number of events n b s the equation (7) allows to determine the con- 
fidence interval [Xdown(n b s ) , X up (n b s )} of possible parameters A. Note that as in Bayesian 
approach the choice of n + (X) and n_(A) is not unique. In general we have the following 
equations [4, 5, 6] for the determination of Xdown and X up : 



oo 



J2 P(n, Xdown) =/?', (8) 

n obs 
^obs 

P(n, X up ) = a , (9) 



n=n oba 



n=0 

where 

a' + = a. (10) 

In this paper we show that the modified frequentist definition of the confidence interval 
is equivalent to the Bayesian approach. Consider the probability to observe the number 
of events n < n b s 

P_K 6s |A) = xJp(n|A). (11) 

ra=0 

To determine possible values Xd OW n and X up of the confidence interval we require that 

1-/3' >P-(nobs\X) >a, (12) 
where a + = a. The equations for the determination X up and Xdown have the form 

P-(n obs \X up ) = a , (13) 

P-{n ob s\Xdown) = 1 -/?'• (14) 

Note that as in the case of Bayesian approach the choice of a and is not unique. Due 
to the identity [6] 

P-(n obs \X) = / P(n obs \X)dX (15) 

J A 
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[ XuP P(n obs \\')d\' = 1 -a-0 . (19) 



the confidence interval [Xdown, X up ] is determined from the equations 

a = P{n obs \\ )dX , (16) 

(3 = / P(n obs \X)d\ . (17) 
Jo 

The parameter A lies in the interval 

Xdown < A < X up (18) 

with the probability (1 — a — 0). Due to equations (16,17) our modified frequentist 
definition (12) is equivalent to Bayes definitions (3,4,5) with flat prior 7r(A) = 1, namely: 

own 

The coverage of the definition (12) means the following. For a hypothetical ensemble of 
similar experiments the probability to observe the number of events n < n obs satisfies the 
inequalities (12). As we noted before the choice of Xdown and X up is not unique. Probably 
the most natural choice is the use of the ordering principle. According to this principle we 
require that the probability density P(n b s \X) inside the confidence interval [Xdown, X up ] is 
bigger or equal to the probability density outside this interval. For Poisson distribution 
this requirement leads to the formula 

P{ n obs\Xdown) — P(n o b s \X up ) (20) 

for the determination of X up and Xdown- For such ordering principle a and are not 
independent quantities. It is natural to use a = a + as a single free parameter. 
Note that the equations (9) and (13) for the determinations of an upper limit X up in 
frequentist and modified frequentist approach coincide whereas the equations (8) and 
(14) are different. Namely, the equation (14) is equivalent to the equation 

oo 

E P(n\X) = 0. (21) 

n=n obs +l 

So we see that the summation in our modified equation (21) starts from n obs + 1 whereas 
in classical frequentist equation (8) the summation starts from n b s - Classical frequentist 
equation (8) is equivalent to the Bayes equation (5) with prior 7r(A) ~ j. 
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Note that for the case of continuous variable x ( +00 > x > —00) our inequality (12) 
takes the form 

1 - > I* " 3 f(x'\X)dx > a . (22) 
J —00 

Here f(x, A) is the probability density function and A is some unknown parameter. The 
equations for the determination of X up and Xdown are 

/ x obs f 1 , 

f(x \X up )dx = a , (23) 
-00 

r°° , , 1 

\ f(x \X down )dx = ft . (24) 

x obs 

The equations (23,24) coincide with classical Neyman belt equations. 

It is possible to generalize our modified frequentist definition (12), namely: 

1-0 > P_{n ohs \X-k) >a, (25) 

where 

n obs J t~k 

P4n obs \X;k)= J2 P ( n \ X ) ( 26 ) 

n=0 

and k = 0,±1,±2,... 

One can find that formulae (25,26) lead to Bayes equations (4,5) with the prior function 
7r(A) ~ X k . We can further generalize formulae (25,26) by the introduction 

P- {n obs |A;c fe ) = Yj c I P - ( n obs I A; fc) , (27) 
k 

where J2k c t = 1- Again we require that 

1-0 >P-{n obs \X-c k ) >a. (28) 
One can find that our definition (28) is equivalent to Bayes approach with prior function 

^(A) = £4U\ (29) 

k 

where 

(n + k)\ 
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For the case when we have nonzero background the parameter A is represented in the 
form 

X = b + s. (31) 

Here b > is known background and s is unknown signal. In Bayes approach the gener- 
alization of the formula (2) reads 



/ | t \ P(n obs \b + s)ir(b, s) 

Jo {P{n obs \b + s )ir(b,s)ds 



For flat prior we have 



f i h\ P(n obs \b + s) 
P{sKbs > b) = fr(P(n obs \X)d\>- (33) 

So we see that the main effect of nonzero background is the appearance of the factor 

K(n obs ,b) = / P{n obs \\)d\ (34) 

J b 

in the denominator of formula (33). For zero background K(n obs ,b = 0) = 1. One can 
interpret the appearance of additional factor K(n obs , b) in terms of conditional probability. 
Really, for flat prior the P(n obs , X)d\ is the probability that parameter A lies in the interval 
[A, X+dX]. For the case of nonzero background b parameter A = b+s > b. The probability 
that A > b is equal to p(X > b\n obs ) = K(n obs , b). The conditional probability that A lies 
in the interval [A, A + dX] provided A > b is determined by the standard formula of the 
conditional probability 

p(X, n obs X > b)dX = dX = — -dX (35) 

p(X > b) K(n obs s) 

and it coincides with the Bayes formula (33). 

In the frequentist approach the naive generalization of the inequality (12) is 

1-0 > P-(n obs \s + b) >a. (36) 

One can show that 

l-a'-0= f" +SuP P(n obs \X')dX' < f°° P(n obs \X')dX' . (37) 



However the main drawback of the definition (36) is that the probability that the signal 
s lies in the interval < s < oo is equal to J b °° P(n obs \X')dX' and it is less than unity for 



nonzero background s > that contradicts to the intuition that the full probability that 
the signal s lies between zero and infinity must be equal to unity. To cure this drawback 
let us require that 

1 ~ P-{n obs \b) ~ 1 ; 

The inequality (38) leads to the equations for the determination of s down and s up which 
coincide with the corresponding Bayes equations. The generalization of the inequalities 
(38) is straightforward, for instance the inequality (28) reads 



1-/3 > „ ' , V > a . (39) 
P-{n obs \b; c k ) 

Upper limit on the signal s derived from the inequality (38) coincides with the upper limit 
in CL S method [7, 8]. 

Note that frequentist equations (8,9) for X up = Xdown don't satisfy the evident equality 
a + /3' — 1. The natural generalization of the equations (8,9) looks as follows 

P-i{n bs\K P ) = ol , (40) 

P+l(n bs\^down) = , (41) 



where 



P-i(n obs \X)= P N A ) +^K&*|A), (42) 

n=0 1 
oo y 

P+iK bs |A)= J2 P(n\\) + -P(n obs \X) . (43) 

n=n obs + l 



Note that 



P-!(n obs \X) + P +1 (n obs \X) = 1 (44) 

and a + — 1 for X up = Xdown- The equations (40,41) are equivalent to the Bayes 
equations (4,5) with prior 

T(A) = i(l + ^). (45) 
The modified frequentist definition (12) takes the form 

1-0 >P-i(n obs \X)>a . (46) 
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To conclude let us stress our main result. For Poisson distribution we have proposed 
modified frequentist definition of the confidence interval and have shown the equivalence 
of the modified frequentist approach and Bayes approach. 

This work has been supported by RFBR grant N 10-02-00468. 
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